CDS

Accession Number TCMCG008C20871
gbkey CDS
Protein Id XP_020226977.1
Location complement(join(32202994..32203224,32203307..32203450,32203538..32203625,32204467..32204572,32205172..32205326,32205651..32205720,32212104..32212270,32213839..32213906,32214190..32214293,32214370..32214540,32215095..32215232,32215937..32216120,32216222..32216515))
Gene LOC109808393
GeneID 109808393
Organism Cajanus cajan

Protein

Length 639aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA376605
db_source XM_020371388.2
Definition serine protease SPPA, chloroplastic [Cajanus cajan]

EGGNOG-MAPPER Annotation

COG_category OU
Description protease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K04773        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCACGCACTCGCATTGCTGCTATTCACCGCTTACGCTACAGCTTCACCTACACCGCATCATTATCTGCAACTACTCTCTCTCGCTCTCAATTCCAATGTCATGGTTATTCTTCTTCCGCCCCCAATCAAGGCCTAGTCGGTGGCGGCCACGAGCATTACCCCACTGGAGACTTTGATTTCAACCCTATCACAGGATGGAAAAAAAGTATCGTCAAGCTCAAGATGCTAACAGCCTGGCCCTGGGAGCGTCTCCGATACGGCACCGTCTTCACTGTCAAGTTGCGCGGCCAGATTTCGGATCAGCTCAAGACTAGATTCTCTCCGGGATTATCTCTGCCTCAAATTTGTGATAATTTCTTGAAGGCGGCTTATGATCCTCGAATTTCCGCCATCTATCTTCACATTGATATTTTAAACTGCGGTTGGGCCAAGGTCGAAGAAATTCGAAGGCACATCTTGAATTTCAGAAAATCAGGGAAATTTATTGTGGCTTACGTCCCTTCATGTCGAGAAAAAGAATACTATATTGCATGTGCCTGTGAAGAGATATATGCCCCTCCAAGTGCTTATTTTTCTTTGTTTGGATTGACTGTTCAAGCCCAATTCGTCAGAGGTGTTTTGGAGAATATTGGAATTGAACCACAATTGGAAAGGATTGGCAAATACAAAAGTGTAGGAGATCAACTAACCCGTAGAACCATGTCTGAAGATCATCATGAGATGCTGACTTCATTGCTTGATAACATCTATACAAATTGGTTGGACAAAGTCTCTTCTGCTAGAGGAAAGAAAAGAGAAGATATTGAGAATTTCATAAATGAAGGTGTTTATCAAGTAGAGAGGCTTAAAGAAGATGGCTTCATATCAGACATAATGTATGACGATGAGGTTATCACTAGGTTGAAGGAGAGACTTCAAGTGAAAACAAATAAAAATCTGCCTATGGTTAATTACAGAAAATACTCTGGAGTCAGGAAATCAACTCTTGGACTATCAGGTGGTAAAGATTTAATAGCCATCATCCGAGCTTCAGGGAGTATTCGTCGTATCGAGAGTCCATTAAGTTCCCGTAGCTCAGGTATCATTGGAGAGAAGTTCATTGAGAAGATACGCAGGGTTAGAGAGTCAAATAAATATAAGGCAGCTATTATCCGAATTGACAGTCCAGGAGGTGATGCTCTTGCTTCCGATTTGATGTGGAGAGAAATCAGGCTTTTGGCTGCCTCAAAACCCGTCATTGCTTCAATGTCTGATGTGGCAGCAAGTGGAGGGTACTACATGGCAATGGGGGCAGGAGCTATTGTTGCAGAAAGTCTTACCTTAACTGGTTCAATTGGAGTGGTCACAGGAAAATTTAACCTTGGGAAACTTTATGAGAAGATTGGCTTCAACAAAGAAATTATATCGAGGGGTAGATATGCTGAGCTCCGTGCAGCTGAACAGCGTTCTTTTAGACCAGATGAAGCAGAGCTATTTTCCAAGTCTGTGCAACATGCTTATAAACAATTTCGAGACAAGGCTGCCGTTTCTCGATCAATGAGTGTAGACAAGATGGAAGAGGTTGCACAGGGAAGGGTTTGGACTGGTAAGGACGCAGCTTCTCATGGTTTGATTGATGCTATTGGTGGTCTTTCTCGAGCTGTTGCCATTGCAAAATTGAAGGCCAATATACCTCAAGACAGACAGGTTACTATTGTGGAGCTCTCGAGACCCAGCCCTACTCTGCCCGAGATTTTAAGTGGTCTAGGTAATTCTCTCGTTGGAGTAGACACAACTTTAAAGGAATTATTACAGGACCTGACAGTTTCCCATGGAGTCCAAGCACGAATGGATGGGATCATGTTTGAGAAATTGGAAGGAAATCCACACGCCAACCCCATTTTGACATTAATTAAAGATTATCTTAGTTCCCTCTAG
Protein:  
MSRTRIAAIHRLRYSFTYTASLSATTLSRSQFQCHGYSSSAPNQGLVGGGHEHYPTGDFDFNPITGWKKSIVKLKMLTAWPWERLRYGTVFTVKLRGQISDQLKTRFSPGLSLPQICDNFLKAAYDPRISAIYLHIDILNCGWAKVEEIRRHILNFRKSGKFIVAYVPSCREKEYYIACACEEIYAPPSAYFSLFGLTVQAQFVRGVLENIGIEPQLERIGKYKSVGDQLTRRTMSEDHHEMLTSLLDNIYTNWLDKVSSARGKKREDIENFINEGVYQVERLKEDGFISDIMYDDEVITRLKERLQVKTNKNLPMVNYRKYSGVRKSTLGLSGGKDLIAIIRASGSIRRIESPLSSRSSGIIGEKFIEKIRRVRESNKYKAAIIRIDSPGGDALASDLMWREIRLLAASKPVIASMSDVAASGGYYMAMGAGAIVAESLTLTGSIGVVTGKFNLGKLYEKIGFNKEIISRGRYAELRAAEQRSFRPDEAELFSKSVQHAYKQFRDKAAVSRSMSVDKMEEVAQGRVWTGKDAASHGLIDAIGGLSRAVAIAKLKANIPQDRQVTIVELSRPSPTLPEILSGLGNSLVGVDTTLKELLQDLTVSHGVQARMDGIMFEKLEGNPHANPILTLIKDYLSSL